SEQUENCE LISTING 



(1). GENERAL INFORMATION: 

(i) APPLICANT: Barbas III, Carlos F. 

Gottesfeld, Joel M. 
Wright, Peter E. 

(ii) TITLE OF INVENTION: ZINC FINGER PROTEIN DERIVATIVES 
AND METHODS THEREFOR 

(ill) NUMBER OF SEQUENCES: 62 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/863,813 

(B) FILING DATE: 27-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/676,318 

(B) FILING DATE: 18-JUL-1996 

(A) APPLICATION NUMBER: 08/183,119 

(B) FILING DATE: 18-JAN-1996 

(A) APPLICATION NUMBER: US95/00829 

(B) FILING DATE: 18-JAN-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haile, Lisa A., Ph.D. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 08401/010001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(B) LOCATION: 1...1 

(D) OTHER INFORMATION: wher Xaa at position is Tyr or 



(B) LOCATION: 2, 4-7, 9-11, 13-17, 19, 22-25, and 27-32 
(D) OTHER INFORMATION: where Xaa at the above positions is Ala, Arg 
Asn, Asp, Cys, Glu, Gin, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, 
Trp, Tyr, or.Val; Xaa at positions 6 or 7 may be missing; Xaa at position 25 
may be missing; Xaa at positions 29, 30, 31, or 32 may be missing 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Xaa Xaa Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa 

1 5 10 15 

xaa Leu Xaa Xaa His Xaa Xaa Xaa Xaa His Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGAAACTGC TCGAGCCCTA TGCTTGCCCT GTCGAG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAGGAGGAGG AGACTAGTGT CCTTCTGTCT TAAATGGATT TTGGT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: zif 268Xho-Spe 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CTC GAG CCC TAT GCT TGC CCT GTC GAG TCC TGC GAT CGC CGC TTT TCT 48 
Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 
15 10 15 

CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC CAG AAG 96 
Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly Gin Lys 
20 25 30 

CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC.AGT CGT AGT GAC CAC 144 
Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 
35 40 45 

CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT GCC TGT 192 
Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 
50 55 60 

GAC ATT TGT GGG AGG AAG TTT GCC AGG AGT GAT GAA CGC AAG AGG CAT 240 
Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 
65 70 75 80 

ACC AAA ATC CAT TTA AGA CAG AAG GAC ACT AGT 273 
Thr Lys He His Leu Arg Gin Lys Asp Thr Ser 
85 90 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 

15 10 15 

Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly Gin Lys 

20 25 30 

Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 

35 40 45 

Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 

50 55 60 

Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 
65 70 75 80 

Thr Lys He His Leu Arg Gin Lys Asp Thr Ser 
85 90 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: FTX3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCAATTAACC CTCACTAAAG GG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: BZF3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGCAAACTTC CTCCCACAAA T 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF36K 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATTTGTGGGA GGAAGTTTGC CNNKAGTNNK NNKNNKNNKN NKCATACCAA AATCCATTTA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: R3B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTGATATTCA CAAACGAATG G 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 2FNsi-B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATGCATATT CGACACTGGA A 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF2r6F 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CAGTGTCGAA TATGCATGCG TAACTTCNNK NNKNNKNNKN NKNNKACCAC CCACATCCGC 
ACCCAC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: AFl6rb 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTGGCCTGTG TGGATGCGGA TATGMNNMNN MNNMNNMNNC GAMNNAGAAA AGCGGCGATC 
GCAGGA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : 8 ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZFIF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CATATCCGCA TCCACACAGG CCAG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Ser Asp Glu Leu Thr Arg His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15i 

Ser Arg Ser Asp His Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CGTAAATGGG CGCCCTTTTG GGCGCCCATT TACG 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Arg Ser Asp Glu Arg Lys Arg His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Trp Ser lie Pro Val Leu Leu His 
1 - 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Trp Ser Leu Leu Pro Val Leu His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Phe Ser Phe Leu Leu Pro Leu His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Leu Ser Thr Trp Arg Gly Trp His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Thr Ser lie Gin Leu Pro Tyr His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TGATCTCAGA AGCCAAGCAG GGTCGGGCCT GGTTAGTACT TGGATGGGAG ACCGCCTGGG 60 
A 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tvr He Cys Ser Phe Ala Asp Cys Gly Ala Ala Tyr Asn Lys Asn Trp 

1 5 10 15 

Lys Leu Gin Ala His Leu Cys Lys His Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Phe Pro Cys Lys Glu Glu Gly Cys Glu Lys Gly Phe Thr Ser Leu His 

1 5 10 15 

His Leu Thr Arg His Ser Leu Thr His Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Phe Thr Cys Asp Ser Asp Gly Cys Asp Leu Arg Phe Thr Thr Lys Ala 

15 10 15 

Asn Met Lys Lys His Phe Asn Arg Phe His 
20 25 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 bas pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TGGATGGGAG ACC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Arg Ser Asp Glu Arg Lys Arg His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTCCATAAGA TTAGCGGATC C 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTGAGCGAGG AAGCGGAAGA G 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCTGCGTGGG CGCCCTTTTG GGCGCCCACG CAGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(B) LOCATION: 4,«.4 

(D) OTHER INFORMATION: where Xaa at position 4 is Lys or Pro 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Thr Gly Glu Xaa 
1 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Secjuence 

(B) LOCATION: 1...459 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG CTC GAG CTC CCC TAT GOT TGC CCT GTC GAG TCC TGC GAT CGC CGC 48 
Met Leu Glu Leu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg 
15 10 15 

TTT TCT CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC 96 
Phe Ser Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly 
20 25 30 

CAG AAG CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC AGT CGT AGT 144 
Gin Lys Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser 
35 40 45 

GAC CAC CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT 192 
Asp His Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe 
50 55 60 

GCC TGT GAC ATT TGT GGG AGG AAG TTT GCC AGG AGT GAT GAA CGC AAG 240 
Ala Cys Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys 
65 70 75 80 

AGG CAT ACC AAA ATC CAT ACC GGR CAG AAG CCC ACT AGT GGC GGT GGT 288 
Arg His Thr Lys He His Thr Xaa Gin Lys Pro Thr Ser Gly Gly Gly 
85 90 95 



o 

ilJ 



CGG ATC GCC CGG CTG GAG GAA AAA GTG AAA ACC TTG AAA GCG CAA AAC 336 
Ara lie Ala Arg Leu Glu Glu Lya Val Lys Thr Leu Lys Ala Gin Asn 
100 105 110 

TCC GAG CTG GCG TCC ACC CGG AAC ATG CTC AGG GAA CAG GTG GCA CAG 384 
Ser Glu Leu Ala Ser Thr Arg Asn Met Leu Arg Glu Gin Val Ala Gin 
115 120 125 

CTT AAA CAG AAA GTC ATG AAC CAC GCT AGC GGC CAG GCC GGC CAG TAC 432 
Leu Lys Gin Lys Val Met Asn His Ala Ser Gly Gin Ala Gly Gin Tyr 
130 135 140 

CCG TAC GAC GTT CCG GAC TAC GCT TCT TAA 462 
Pro Tyr Asp Val Pro Asp Tyr Ala Ser 
145 150 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Leu Glu Leu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg 

1 5 10 15 

Phe Ser Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly 

20 25 30 

Gin Lys Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser 

35 40 45 

Asp His Leu Thr Thr His lie Arg Thr His Thr Gly Glu Lys Pro Phe 

50 55 60 

Ala Cys Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys 
65 70 75 80 

Arg His Thr Lys He His Thr Xaa Gin Lys Pro Thr Ser Gly Gly Gly 

85 90 95 

Arg He Ala Arg Leu Glu Glu Lys Val Lys Thr Leu Lys Ala Gin Asn 

100 105 110 

Ser Glu Leu Ala Ser Thr Arg Asn Met Leu Arg Glu Gin Val Ala Gin 

115 120 125 

Leu Lys Gin Lys Val Met Asn His Ala Ser Gly Gin Ala Gly Gin Tyr 

130 135 140 

Pro Tyr Asp Val Pro Asp Tyr Ala Ser 
145 150 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .459 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATG CTC GAG CTC CCC TAT GCT TGC CCT GTC GAG TCC TGC GAT CGC CGC 48 
Met Leu Glu Leu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg 
15 10 15 

TTT TCT CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC 96 
Phe Ser Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly 
20 25 30 

CAG AAG CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC AGT CGT AGT 144 
Gin Lys Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser 
35 40 45 

GAC CAC CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT 192 
Asp His Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe 
50 55 60 

GCC TGT GAC ATT TGT GGG AGG AAG TTT GCC AGG AGT GAT GAA CGC AAG 240 
Ala Cys Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys 
65 70 75 80 

AGG CAT ACC AAA ATC CAT ACC GGT CAG AAG CCC ACT AGT GGC GGT GGT 288 
Arg His Thr Lys He His Thr Gly Gin Lys Pro Thr Ser Gly Gly Gly 
85 90 95 

CTG ACC GAC ACC CTG CAG GCG GAA ACC GAC CAG CTG GAA GAC GAA AAA 336 
Leu Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Leu Glu Asp Glu Lys 
100 105 110 

TCC GCG CTG CAA ACC GAA ATC GCG AAC CTG CTG AAA GAA AAA GAA AAG 384 
Ser Ala Leu Gin Thr Glu He Ala Asn Leu Leu Lys Glu Lys Glu Lys 
115 120 125 

CTG GAG TTC ATC CTG GCG GCA CAC GCT AGC GGC CAG GCC GGC CAG TAC 432 
Leu Glu Phe He Leu Ala Ala His Ala Ser Gly Gin Ala Gly Gin Tyr 
130 135 140 

CCG TAC GAC GTT CCG GAC TAC GCT TCT TAA 462 
Pro Tyr Asp Val Pro Asp Tyr Ala Ser 
145 150 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Leu Glu Leu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg 

1.5 10 15 

Phe Ser Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly 

20 25 30 

Gin Lys Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser 

35 40 45 

Asp His Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe 
50 55 60 



Ala 


Cys 
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He 


Cys 


Gly Arg 


Lys 
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Ala Arg 


Ser Asp Glu Arg Lys 
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Val 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CGCCCACGCN GCGTGGGCG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CGCCCACGCN GCGGCGGCGG CGGCGGCG 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 76 amino acids 
(6) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(B) LOCATION: 76,..76 

(D) OTHER INFORMATION: where Xaa at position 76 is Tyr-Ala-Cys-Pro- 
Val-Glu-Ser-Cys-Asp-Arg-Arg-Phe-Ser-Lys-Ser-Ala-Asp-Leu-Lys-His-Ile-Arg-His- 
Thr-Gly-Glu-Lys-Pro-Met-Lys-Leu-Leu-Glu-Pro-repeated 10 times; residue 76=340 
amino acids; some Xaa's may be missing 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Lys Leu Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg 

1 . 5 10 15 

Arg Phe Ser Lys Ser Ala Asp Leu Lys Arg His He Arg His Thr Gly 
20 25 30 



Glu Lys Pro Met Lys Leu Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser 

35 40 45 

Cys Asp Arg Arg Phe Ser Lys Ser Ala Asp Leu Lys His He Arg His 

50 • 55 60 

Thr Gly Glu Lys Pro Met Lys Leu Leu Glu Pro Xaa 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCTCGCCGCC GCGGGTTTTC CCGCGCCCCC GAGG 34 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: l..,294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG AAA CTG CTC GAG CCC TAT GOT TGC OCT GTC GAG TCC TGC GAT CGC 48 
Met Lys Leu Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg 
15 10 15 

CGC TTT TCT AAG TCG GOT GAT CTG AAG CGC CAT ATC CGC ATC CAC ACT 96 
Arg Phe Ser Lys Ser Ala Asp Leu Lys Arg His He Arg He His Thr 
20 25 30 

GGC GAA AAA CCG TAC GCG TGC OCT GTC GAG TCC TGC GAT CGC CGC TTT 144 
Gly Glu Lys Pro Tyr Ala Cys Pro Val Glii Ser Cys Asp Arg Arg Phe 
35 40 45 

TCT AAG TCG GOT GAT CTG AAG CGC CAT ATC CGC ATC CAC ACC GGG GAG 192 
Ser Lys Ser Ala Asp Leu Lys Arg His He Arg He His Thr Gly Glu 
50 55 60 

AAG CCC TAT GOT TGC CCT GTC GAG TCC TGC GAT CGC CGC TTT TCT AAG 240 
Lys Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser Lys 
65 70 75 80 

TCG GOT GAT CTG AAG CGC CAT ATC CGC ATC CAC ACC GGT CAG AAG CCC 288 
Ser Ala Asp Leu Lys Arg His He Arg He His Thr Gly Gin Lys Pro 
85 90 95 



ACT AC 
Thr Thr 



293 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Leu Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg 

15 10 15 

Arg Phe Ser Lys Ser Ala Asp Leu Lys Arg His He Arg He His Thr 

20 25 30 

Gly Glu Lys Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe 

35 40 45 

Ser Lys Ser Ala Asp Leu Lys Arg His He Arg He His Thr Gly Glu 

50 55 60 

Lvs Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser Lys 
65 70 75 80 

Ser Ala Asp Leu Lys Arg His He Arg He His Thr Gly Gin Lys Pro 
85 90 95^ 

Thr Thr 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: l..,543 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG CTC GAG CTC CCC TAT GOT TGC CCT GTC GAG TCC TGC GAT CGC CGC 
Met Leu Glu Leu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg 
15 10 15 

TTT TCT CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC 
Phe Ser Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly 
20 25 30 

CAG AAG CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC AGT CGT AGT 
Gin Lys Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser 
35 40 45 

GAC CAC CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT 
Asp His Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe 
50 55 60 



GCC TGT GAC ATT TGT GGG AGG AAG TTT GCC AGG AGT GAT GAA CGC AAG 
Ala Cys Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys 
65 70 75 80 



Q 



o 



AGG CAT ACC AAA ATC CAT ACC GGG GAG AAG CCC TAT GCT TGC CCT GTC 288 
Aro His Thr Lys He His Thr Gly Glu Lys Pro Tyr Ala Cys Pro Val 
85 90 95 

GAG TCC TGC GAT CGC CGC TTT TCT CGC TCG GAT GAG CTT ACC CGC CAT 336 
Glu Ser Cys Asp Arg Arg Phe Ser Arg Ser Asp Glu Leu Thr Arg His 
100 105 110 

ATC CGC ATC CAC ACA GGC CAG AAG CCC TTC CAG TGT CGA ATA TCC ATG 384 
He Arg He His Thr Gly Gin Lys Pro Phe Gin Cys Arg He Ser Met 
115 120 125 

CGT AAC TTC AGT CGT AGT GAC CAC CTT ACC ACC CAC ATC CGC ACC CAC 432 
Arg Asn Phe. Ser Arg Ser Asp His Leu Thr Thr His He Arg Thr His 
130 135 140 

ACA GGC GAG AAG CCT TTT GCC TGT GAC ATT TGT GGG AGG AAG TTT GCC 480 
Thr Gly Glu Lys Pro Phe Ala Cys Asp He Cys Gly Arg Lys Phe Ala 
145 150 155 160 

AGG AGT GAT GAA CGC AAG AGG CAT ACC AAA ATC CAT TTA AGA CAG AAG 528 
Arg Ser Asp Glu Arg Lys Arg His Thr Lys He His Leu Arg Gin Lys 
165 170 175 

GAC TCT AGA ACT AGT 543 
Asp Ser Arg Thr Ser * 
180 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Val Glu Ser Cys Asp Arg Arg 

10 15 
Arg His He Arg He His Thr Gly 

25 30 
Cys Met Arg Asn Phe Ser Arg Ser 
45 

Thr His Thr Gly Glu Lys Pro Phe 
60 

Phe Ala Arg Ser Asp Glu Arg Lys 
75 80 
Glu Lys Pro Tyr Ala Cys Pro Val 

90 95 
Arg Ser Asp Glu Leu Thr Arg His 
105 110 
Pro Phe Gin Cys Arg He Ser Met 
125 

Leu Thr Thr His He Arg Thr His 
140 

Asp He Cys Gly Arg Lys Phe Ala 
155 160 
Thr Lys He His Leu Arg Gin Lys 
170 175 



Met Leu Glu Leu 


Pro 
5 


Tyr 


Ala Cys 


1 

Phe Ser Arg Ser 


Asp 


Glu 


Leu Thr 


20 








Gin Lys Pro Phe 


Gin 


Cys 


Arg He 


35 






40 


Asp His Leu Thr 


Thr 


His 


He Arg 


50 






55 


Ala Cys Asp He 


Cys 


Gly 


Arg Lys 


65 




70 




Arg His Thr Lys 


He 


His 


Thr Gly 


85 






Glu Ser Cys Asp 


Arg 


Arg 


Phe Ser 


100 








He Arg He His 


Thr 


Gly 


Gin Lys 


115 






120 


Arg Asn Phe Ser 


Arg 


Ser 


Asp His 


130 






135 


Thr Gly Glu Lys 


Pro 


Phe 


Ala Cys 


145 




150 




Arg Ser Asp Glu 


Arg 


Lys 


Arg His 



Asp Ser Arg Thr Ser 
180 

(2). INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GAGGAGGAGG AGGGATCCAT GCTCGAGCTC CCCTATGCTT GCCCTG 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
GAGGAGGAGA CCGGTATGGA TTTTGGTATG CCTCTTGCG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GAGGAGGAGA CCGGTGAGAA GCCCTATGCT TGCCCTGTCG AGTCCTGCGA TCGCCGC 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GAGGAGGAGA CTAGTTCTAG AGTCCTTCTG TC 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GATGTATGTA GCGTGGGCGG CGTGGGCGTA AGTAATGC 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GATGTATGTA GCGTGGGCGG GGGCGGGGTA AGTAATGC 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GATGTATGTA GCGGCGGCGG CGGCGGCGTA AGTAATGC 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52* 
GATGTATGTA GCGTGGGCGT AAGTAATGC 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
GATGTATGTA GGGGCGGGGT AAGTAATGC 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GATGTATGTA GCGTGGGCGT AAGTAATGC 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GAGGAGGAGG AATTCCGACA TTTATAATGA ACGTGAATTG C 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TGCGCCCACG CCGCCCACGC GATGATTGGG AGCTTTTTTT GCACG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TCGCGTGGGC GGCGTGGGCG CAAAAAATTA TTATCATGGA TTCTAAAACG G 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GAGGAGGAGG CGGCCGCAGG TAGATGAGAT GTGACGAACG TG 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TGCCCCGCCC CCGCCCACGC GATGATTGGG AGCTTTTTTT GCACG 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TCGCGTGGGC GGGGGCGGGG CAAAAAATTA TTATCATGGA TTCTAAAACG G 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GCGTGGGCGG CGTGGGCG 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GCGTGGGCGG GGGCGGGG 



